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Abstract 

We look at the evolution through rewiring of the degree distribution of a network 
so the number edges is constant. This is exactly equivalent to the evolution of prob- 
ability distributions in models of cultural transmission with drift and innovation, or 
models of homogeneity in genes in the presence of mutation. We show that the mean 
field equations in the literature are incomplete and provide the full equations. We 
then give an exact solution for both their long time solution and for their approach to 
equilibrium. Numerical results show these are excellent approximations and confirm 
the characteristic simple inverse power law distributions with a large scale cutoff under 
certain conditions. The alternative is that we reach a completely homogeneous solu- 
tion. We consider how such processes may arise in practice, using a recent Minority 
Game study as an example. 

Introduction 

The observation of power law probability distribution functions for things as diverse as 
city sizes, word frequencies and scientific paper citation rates has long fascinated people. 
Yule, Zipf, Simon and Price [2 EJ El provide some, but by no means all, of the classic 
examples. Coupled with modern ideas of critical phenomena and self-organised criticality 
(see jB] for an introduction) this might suggest that such power laws reflect fundamental, 
perhaps inviolable, processes behind human behaviour. These are the modern expositions 
of ideas that have captivated for centuries as exemplified by Thomas Hobbes 1 . So when 
power laws are mixed with modern icons such as the World Wide Web [HI 02] we have an 
intoxicating mixture. 

In the context of complex networks, the focus is usually on power laws in the degree 
distributions, the property which defines a 'scale-free' network (see |10| for a review and 

"WWW: http:/ /www. imperial. ac.uk/people/t.evans 

1 Thomas Hobbes (1588-1679) was a philosopher who held that Human beings are physical objects, 
sophisticated machines whose functions and activities can be described and explained in purely mechanistic 
terms — "The universe is corporeal; all that is real is material, and what is not material is not real" \7\. 
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references). In growing networks, if one connects new vertices to existing vertices chosen 
with a probability proportional to their degree (at least this is the dominant behaviour for 
large degree) — 'preferential attachment' [SJ [§] — then the degree distribution for large 
degree k is of the form /c~ 7 with a power 7 greater than two. 

However, note that the degree distribution of a network is an ultra-local property of 
its vertices. The neighbours of a vertex at the other end of the edges play no role, it 
does not matter that the edge describes some bilateral relationship between two vertices. 
This should not be surprising, the older studies such as Simon and Price [HI HI make 
no reference to a network. One may be added easily to their examples and models but it 
is not necessary. Conversely, one need not refer to the network of the World Wide Web, 
one can just count links on a page. For this reason the model of Simon [2] and that of 
Barabasi, Albert and Jeong jHUH] are identical despite the fact the latter refer to a network, 
the former does not. Thus it is only for convenience that in this paper we will use the 
language of complex networks 2 . One may easily dispense with the network as do many of 
our references. 

We start by observing that there is much less material on the degree distribution of 
networks which don't grow and their non-network counterparts. The model of Watts and 
Strogatz 15 is a primary example of a non-growing network, but it does not produce a 
power law. We will focus on models of non-growing networks where the number of edges 
E is constant (and non-network equivalents) with power law degree distributions. The 
point is that these power laws are invariably simple inverse powers of degree, n(k) oc 1/k, 
and so quite distinct from those found in most growing models. 

Such non-growing models have also been shown to be relevant to a wide range of 
examples. For instance it has been used when considering the transmission of cultural 
artifacts: the decoration of ceramics in archaeological finds |161 1171 ITS] , the popularity of 
first names |19j . dog breed popularity |20j . the distribution of family names in constant 
populations j^. Similar models have been used to study the diversity of genes [22! • The 
same types of probability distribution have also been seen in a study of the Minority Game 
|23| and our model suggests why such features emerge even when there is no explicit 
scale-free network. Such rewiring models have also been studied for their own merits 
[HJI23]. For definiteness in this paper we will use the language of cultural transmission 

m nam mm]. 

The Model 

Consider a simple bipartite model 3 with E 'individual' vertices, each with one edge 4 
running to any one of ./V 'artifact' vertices, as shown in Figure^. The degree of the artifact 
vertices is k indicating that one artifact has been 'chosen' by k distinct individuals. The 
rewiring will be of the artifact ends of the edges, so each individual is always connected to 
the same edge. It is the degree distribution of the artifact vertices which we will consider 
so n(k) is the number of artifacts each of which has been chosen by k individuals. The 
probability distribution of interest is then p(k) = n(k)/N. 

At each time step we make two choices but initially no changes to the network. First 
we choose an individual at random 5 and consider its single link. This is equivalent to 

2 There are some suggestions about how such a power law might emerge only because of the network 
structure [TTl[T21[r31[H] . 

3 We have considered other types of network and the generalisations are straightforward. 

4 The degree of the 'individuals' does not effect the derivations and is only relevant to the interpretation. 

5 We adopt the common convention that 'random' without further qualification indicates that a uniform 
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Figure 1: Illustration of the model. 

choosing an edge at random. It is also equivalent to picking an artifact vertex with pure 
preferential attachment, that is with probability proportional to its degree. We indicate 
the probability of choosing a particular artifact at this stage as H R since we are going to 
remove this edge from this artifact. 

The edge chosen is going to be attached to another artifact vertex picked with proba- 
bility 11,4. This is the second choice we have to make and it will be done with a mixture 
of preferential attachment and random artifact vertex choice. In a fraction p r of the at- 
tachment events we chose a random artifact vertex to receive the rewired edge. In the 
context of studies of cultural transmission [T^J f^J [^J [^J |2Tj] this corresponds to innova- 
tion, while in gene evolution it is mutation |22| . Alternatively with probability p p we use 
preferential attachment to find a new artifact vertex for attachment. This is copying of the 
choice previously made by another individual, drift in the work on cultural transmission 
|16[ I17[ I18[ I19[ 120] , while it is the inheritance mechanism in models of gene [22] or family 
name [2^ homogeneity. If these are the only types of event p p + p r = 1, the number of 
artifacts N is constant and 

U R = ^ V-A=Pr-^+P P ^, (0<k<E). (1) 

Note that there is a chance IIrII^ that we will choose the same artifact vertex for both 
attachment and removal and there will then be no change in the network. 

Finally, once both the artifacts for edge removal and addition have been picked, we 
perform the rewiring. The mean field equation for evolution of n(k) is then j^H] 

n(k,t+ 1) -n(k,t) 

= n(k + l,t)U R (k + l,t)(l-U A {k+l)) 

-n(k, t)U R (k, t) (1 - U A (k)) - n(k, t)U A (k, t) (1 - U R (k)) 
+n(k-l,t)U A (k-l,t)(l-U R (k-l)), (0 < k < E) (2) 

We must set n(k) = H R (k) = n^(A;) = for k = — 1 and k = E + 1 to ensure this equation 
is valid at the boundaries k = and k = E. It is crucial that we include the factors of 
(1 — Tl A ) and (1 — H R ) otherwise the behaviour at the boundaries is incorrect. We are 

distribution is used to draw from the set implicit from the context. 
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explicitly excluding events where the same vertex is chosen for removal and attachment 
in any one rewiring event as they do not change the network but they are likely only if 
n(k ~ E) = 1. Such (1 — II) terms are missing from other discussions of such models but 
the literature usually has n(k ~ E) <C 1 so these factors are negligible. Thus the results 
in the literature will be approximately the same as ours in this regime. 

We can rephrase this as a Markov process. Consider a vector n(t) where ni(t) = 
n(E + 1 — i, t) for % = 1, 2, ...,_£? + 1. Then we can think of the equations (|2|) as 



n(t + 1) 



Mn(i). 



The transition matrix is 



(g{E) h(E-l) 
f(E) g(E-l) h{E-2) 
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where the matrix entries are specified by the functions 



The evolution is then given by the eigenvectors and eigenvalues of M 

E+l 

n(t) = qeW + £ c t \\ 



1 = Ai > A 2 > ... > I A,- 1 > A 



> 



j=2 



Me^ = Aj-e^ j = 1, 2, . . . (E + 1) 



(3) 



(4) 



f(k) 


= U R (k)(l-U A (k)) 


(5a) 


g(k) 


= 1 - f(k) - h(k) 


(5b) 


h(k) 


= U A (k)(l-U R ) 


(5c) 



(6) 
(7) 



Stationary Solution 

The stationary solution for the degree distribution n(k,t) = n(k), the eigenvector associ- 
ated with the largest eigenvalue Ai = 1, can be found by substituting n(k,t) = n(k) into 
the evolution equation @. We then note that if the first and third lines are equal then so 
are the second and third lines. Thus we look for solutions of the form 



n(k + l)U R (k + 1, t) (1 - Ii A {k + 1)) = n(k)U A {k, t) (1 - U R (k)) . 
The result is [2H 

E 



m _ 4(1) V _ PP \_H PP W ^ 

n W~ A r(jfe + i) v(E + i-k) 



(E>k>0) 



(8) 



0) 



where A is a constant normalisation and the average degree is (k) = E/N . This solution 
has two characteristic parts. The first ratio of Gamma functions for E 3> k 3> 1 behaves 
as 

Y(k + —(k)) 

-TflETir KV "< 1+0( *" 1 »' (10) 



For p r E = Np p we have an exact inverse power law. The power is always below one but 
for many values (p r <C 1) the power is close to one. This is very different from the results 
for simple models with growth in the number of edges where demanding that the first 
moment is finite, (k) < oo, requires 7 > 2. 

However the (1 — ILa) and (1 — Hr) terms in (j2j) have led to the second ratio of Gamma 
functions which \ip v k <C E gives an exponential cutoff 

»~» <fc> ~*) oc exp{-^}(l + 0(^)), k^(-- P A, (11) 



Y{E + l-k) y E' n \p p p Pi 

2pr(k) 
E 

Pr ii Pr < 1, (k) « £ . (13) 



ln(p p )-^ (12) 



However the numerator of this second ratio of Gamma functions becomes very large for 
k = E if (E — p r (k)) <C Pp. So this happens if p r <S TV.simpiej where 

Pr , lmpl « = ¥T i- w 4 + «^ +0(£ -). (14) 

This spike at k = E will dominate the degree distribution. The point where the distribu- 
tion has become flat at the upper boundary, so n{E) = n(E — 1) defines another critical 
random attachment probability p rjC rit at 

E-l ( . 

Prcrit - E 2 + E{1 _ {k)) _ 1 _ {k) W 

E Prcli t « l+ (<fc) ~ 2) (16) 
hi 

Thus when p r < 1 / E the degree distribution will show a spike at k = E. 

Overall we see two distinct types of distribution. For large innovation or mutation 
rates, Ep r > 1, we get a simple inverse power with an exponential cutoff 

n(k)<x(k)-yeM-Ck}, Pr>^ (17) 

This is the behaviour noted in the literature [2U |22l 1201 EHj and since Ep r > 1 the 
formulae of the literature for the power 7 and cutoff £ are a good approximation to the 
exact formulae given here. Note that in any one practical example it will be impossible 
to distinguish the power law derived from the data from 7 = 1. The power drifts away 
from one as we raise the innovation rate p r towards one but only at the expense of the 
exponential regime starting at lower and lower degree. That is, only when the power is 
very close to one can we get enough of a power law to be significant. 

However as p r is lowered towards zero we get a change of behaviour in the exponential 
tail around p r E ~ 1. First we find the exponential cutoff moves to larger and larger 
values, eventually becoming bigger than E. In fact this second ratio of Gamma functions 
becomes equal to one for all k at p r = fV iS i mp ie ~ E^ 1 . At that value of p r = p rjS i m pie 
we have no cut off and we are closest to an exact inverse power law for all degree values. 
Slightly below that value of p r the tail starts to rise for p r > p rjC rit 

w E' 1 . For p r E <C 1, 

i.e. if there has been no random artifact chosen after most edges have been rewired once, 
then we will almost certainly find one artifact linked to most of the individuals, n(E) ~ 1. 
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Figure 2: Plots of the degree distribution normalised with respect to k = 1 and the 
fractional error of the data w.r.t. the exact solution. For N = E = 100 and various 
p r = (1 — p p ) = 0.1 (crosses), 0.01 (circles), 0.005 (stars) and 0.001 (squares), while lines 
are the exact solutions. Measured after 10 5 rewiring events, averaged over 10 runs. The 
size of fluctuations are clear from the deviations about the exact solutions. 

These exact results for the degree distribution are for the mean field equations. These 
are only approximations but because in these models there are no correlations between 
vertices, they should be excellent approximations. Simulations confirm this as Figures [2] 
and [21 show. 

The Generating Function 

Given the exact solution for the degree distribution Q its generating function G(z), where 

E 



G(z):=Y,n(k)z k , (18) 



k=0 



is found to be 



G(z) = n(0)F(K, —E; 1 + K — E — E; z) (19) 

K = ^(k), E = ^E. (20) 
P P P P 

Here F is the Hypergeometric function. The average fraction of sites of zero degree in the 
mean-field calculation is then exactly 

"(0) = r(l-E-E)F(l + K-E) 
N r(l - E + K - E)T(l - E) ' 

The m-th derivative of the generating function is 



1 d m G(z) 



G(l) dz r 



„ . T(K + m)T(-E + m)T(l - E - m) , , 

= (k(k-l) . . . (jfe-m+1)) = — — 1 — 4^ 1 y ~ '-. (22) 

*=i r(K)T(-E)T(l - E) 



Knowing all the derivatives up to order m gives all the moments (k n ) up to that order. For 
case m = 1 this provides a consistency check on the parameter (k) = E/N - the average 
artifact degree. 
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Figure 3: The degree distribution normalised to n(l) and the fractional error w.r.t. the 
exact solution for N = E, Ep r = 10.0 and p r = 10~ 2 (crosses), 10~ 3 (circles) and 10 -4 
(stars). Measured after 10 7 rewiring events, averaged over 10 3 runs. The tails of the 
distributions have few data points and so they show larger fluctuations about the the 
mean field results, as seen on the right. Despite this, for p r = 10~ 4 there are clear signs 
we have not quite reached equilibrium. Started with n{k = 1) = E but otherwise zero. 

New Artifact Addition 

The cultural transmission models [161 1171 HUH 1181 120j . the gene evolution model of [22^ and 
the model of family name distributions [2^ include another attachment process. There a 
new artifact vertex is added to the network with probability p = 1 — p r — p p . The new 
artifact receives the edge removed from an existing artifact on the same time step. In 
this case the number of artifacts becomes infinite so most artifacts have no edges. Then 
the random attachment becomes completely equivalent to this new process of artifact 
addition. Thus the large N, zero (k) limit of our equations reproduces this case 6 . The 
degree distribution for k > 1 behaves exactly as above — a simple inverse degree power 
law cutoff by an exponential for E(l — p p ) > 1 while for E(l — p p ) < 1 n(E) ~ 1. In this 
model though, when p r = p r>s i mp i e we have a degree distribution which is an exact inverse 
power law for the whole range of non-zero degrees. Our exact solutions to the mean field 
equations again fits the data as Figure 0] shows. 

Equilibration Rate 

So far we have studied only the long time equilibrium distribution of and therefore 
the first eigenvector of the matrix M of ||f associated with eigenvalue 1. However in 
Figure El there is clear evidence that the system has not yet reached equilibrium despite 
the apparently large number of rewiring events (each edge will have been rewired about 
10 times and results were averaged over 100 runs). This should be due to the second 
largest eigenvalue A2 in ©. We conjecture that this is of the form 7 

A 2 = l-|. (23) 

6 Since n(0) diverges this must be excluded from discussions, but this is straightforward. An alternative 
normalisation is needed, such as the number of 'active' artifacts Na = 53f=i n {k). 

7 We have subsequently proved this conjecture |27|. In fact all the eigenvectors and eigenvalues of M of 
have a distinctive pattern which we will report on elsewhere |27| . 
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Figure 4: Plots of the degree distribution normalised with respect to k = 1 and the 
fractional error of the data w.r.t. the exact solution. For E = 100 but with new artifacts 
added with probability p r = 0, p = 1 — p p where p = 0.1 (crosses), 0.01 (circles), 0.005 
(squares) and 0.001 (stars). The lines are the relevant exact mean field solutions. Measured 
after 10 5 rewiring events, and averaged over 10 4 runs. 



We have shown that this is always an eigenvalue of the transition matrix M @ and checked 
numerically that this is indeed the second largest eigenvalue for E < 100. The eigenvector 
with this eigenvalue is 

{2) = , (8) r (k - (k) + 1) r (fc + (t^t <*)) r - ^ (k) - k) 

6 E+i~k T(k-(k)) T{k + l) T(E + l-k) ' 1 ' 

This means that the equilibration time scale — the time taken for contributions from 
eigenvectors (j > 2) to die away — is 

1 EE 2 

(25) 



ln(A( 2 )) Pr (p r E) 

The parameter which controls the shape of the distribution for most examples is (p r E) 
and we expect to see the same shape independent of E. However what is noticable is 
that the rate of convergence slows as E 2 as we increase E for fixed p r E. This is visible in 
Figure [3 Figure El shows the results are consistent with our prediction in (|25jl. 



Conclusions 

We have analysed the degree distribution in rewiring network models and equivalent mod- 
els which make no reference to a network. We have shown that the mean field equations 
are different from the ones considered in the literature. This makes little difference for 
results quoted when p r E ^> 1 but we have demonstrated that only with the extra terms in 
(J2J do we get the correct solution for all values of p r E. Further we have found the second 
eigenvalue and its eigenvector and thus deduced the rate of convergence to the equilibrium 
solution. This scales as E 2 for fixed p r E. 

The literature suggests that probability distributions with a simple inverse size form 
plus a large scale cut off, as found in these models, are common. The real question is 
whether it is the copying mechanism which leads to such distributions in practice? It 
is difficult to understand why in the real world individuals choose a new artifact with a 
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Figure 5: The degree distribution for N = E = 10 3 for (p r E) = 10 (left) (p r E) =0.1 
(right), with p p = 1 — p r , p = 0, with the degree distribution taken at a variety of times: 
5 x 10 3 (squares), 5 x 10 4 (stars), 5 x 10 5 (circles) and 5 x 10 6 (crosses). The initial 
configuration had each artifact connected to one individual and results were averaged over 
100 runs. With (p r E) = 10 we are close to equilibrium after t = 5 x 10 3 steps while this 
is only roughly true after t = 5 x 10 5 time steps for (p r E) = 0.1, consistent with our 
prediction that the equilibration time scale is r = E/p r . 

probability exactly equal to the number of times that artifact has been previously chosen, 
preferential attachment. It is known that for growing networks deviations from this law 
lead to deviations from power law distributions |2H]- Surely in the real world, many 
decisions would be influenced by certain 'leaders' in their fields and we are more likely to 
copy their decision than that of other individuals? 

In fact copying the choice of others, including that of certain 'leaders' may emerge 
naturally. Suppose our individuals were connected to each other by a second network, 
a 'contacts' network. Individuals could use their contacts by copying the advice of a 
friend or a friend of a friend as defined by the network of contacts. This is equivalent to 
making a finite length random walk on the graph of contacts. For growing networks this is 
known to be a way that the structure of the graph can self-organise into a scale-free form 
|11[ 1121 11M| 114] even if the random walk is only one step long. In a similar way, for non- 
growing networks, we are essentially making a one-step walk on the bipartite graph between 
individuals and artifacts, regardless of any network between the individuals. Extrapolating 
the results of [II] to the non-growing case suggests that this should be sufficient to generate 
an effective attachment probability of the form Put simply we expect that whatever 
we do, the probability of arriving at one artifact at the end of a random walk is going to 
be dominated by the number of routes into that artifact, i.e. its degree. 

Such an example may be seen in the the model of the minority game by Anghel 
et al. [2Hj. Their individuals are connected by a random graph and at each time step an 
individual copies the best strategy (the artifact in this case) from amongst the strategies of 
their neighbouring contacts. The individuals do not choose a random neighbour's artifact 
but the 'best' artifact. However if the meaning of best is always changing, as it may be 
in the Minority Game or in many examples of fashion, this best choice may be effectively 
a random neighbour choice and hence be statistically equivalent to the simple copying 
used in our models. Thus even if it appears that the population is influenced by wise men 
or fashion leaders, provided there is little substance to their choices then it may well be 
equivalent to simple copying of one person's choice. It should come as no surprise that 
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the results for the artifact degree distribution in [23] , the popularity of the most popular 
strategies, follows a simple inverse power law with a large degree cutoff, exactly as the 
simple copying model would give. 

Finally one might ask if it is important to get the right classification of artifacts to see 
the distribution. What if people make choices based on one classification but we measure 
on another? Do people choose a specific breed of dog as registered by the dog breeders 
association of their country, or do they really choose between small and large dogs, short 
or long haired dogs [20] ? The classification of pottery in archaeology is one imposed on the 
record by modern archaeologists. The answer ought to be that the classification should 
not be important and it is a scaling property of the model and its solutions that this is so. 

Suppose we randomly paired all the artifacts but deemed the fundamental process 
to be based still on the choice of the original artifacts and their degree. The choice of 
edge to remove is unchanged while preferential attachment to the underlying individual 
artifacts leads to effective preferential attachment to the artifact pairs. The probability 
of an innovation event (p r or p) is unchanged but the probability of choosing a random 
artifact pair is double that of choosing a single artifact. However that reflects the fact 
that the number of artifacts pairs N2 is half the original number of artifacts. Thus we see 
that we require that N — > N2 = (N/2) but we keep all other parameters the same. In 
particular the linear nature of preferential removal and attachment to the artifact pairs 
means that the form of both removal and attachment is unchanged. Overall we have 
exactly the same equations for the artifact pair degree distribution ri2(k) as we did for the 
original artifacts. Thus the distribution n(k) of ® is of the same form with N — > {N/2) 
being the only change required. However we have seen that for Ep r > 1 the shape can 
be parameterised in terms of a power 7 (jlOj) and an exponential cutoff (" (|12[). The latter 
is unchanged and while the power does change a little, we have argued that if 7 can be 
measured, it is likely to be indistinguishable from one in any real data set. So apart from 
the overall normalisation, the distribution of artifact choice is essentially independent of 
how we choose to classify the artifacts. This stability against the classification of the 
artifact types is an important feature of the copying models considered here. 

TSE would like to thank H.Morgan and W.Swanell for useful discussions. 
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Figure 6: Plots of the fractional error of the data of Fig. Elw.r.t. the exact solution for 
(p r E) = 10 (left) (p r E) = 0.1 (right). With N = E = 10 3 , p p = 1 - p r , p = 0, with the 
degree distribution taken at a variety of times: 5 x 10 3 (squares), 5 x 10 4 (stars), 5 x 10 5 
(circles) and 5 x 10 6 (crosses). The initial configuration had each artifact connected to 
one individual and results were averaged over 100 runs. With (p r E) = 10 we are close to 
equilibrium after t = 5 x 10 3 steps while this is only roughly true after t = 5 x 10 5 time 
steps for (p r E) =0.1, consistent with our prediction that the equilibration time scale is 
r = E/p r . 
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